Picture for Genta Indra Winata

Genta Indra Winata

Shammie

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Add code
Jan 24, 2026
Viaarxiv icon

Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection

Add code
Jan 14, 2026
Viaarxiv icon

Can Large Language Models Understand, Reason About, and Generate Code-Switched Text?

Add code
Jan 12, 2026
Viaarxiv icon

Leveraging Parameter Space Symmetries for Reasoning Skill Transfer in LLMs

Add code
Nov 13, 2025
Viaarxiv icon

Optimizing Reasoning Efficiency through Prompt Difficulty Prediction

Add code
Nov 05, 2025
Viaarxiv icon

Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations

Add code
Sep 05, 2025
Viaarxiv icon

SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages

Add code
Aug 09, 2025
Viaarxiv icon

IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian

Add code
Jul 29, 2025
Viaarxiv icon

Language Surgery in Multilingual Large Language Models

Add code
Jun 14, 2025
Viaarxiv icon